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Abstract: Computers are considered innovative in classrooms, raising expectations of increased 
cognitive learning outcomes or motivation with effects on Deeper Learning (DL). The "new medium", 
however, may cause cognitive overloads. Combined with gender-related variations in ability, 
self-efficacy or self-confidence, computers may even diminish learning effects. Our empirical 
study used a quasi-experimental design and the Intrinsic Motivation Inventory (IMI) to monitor 
efficacy in knowledge gain and motivation when using computer-aided versus textbook-based 
educational units. Our sample consisted of 393 eighth graders. One objective focused on gender effects 
associated with autonomous teacher-assisted learning via interactive software or an appropriate 
textbook. Both groups finished with a recapitulation with the teacher. A third group concluded 
a computer-aided lesson with a computer quiz. To provide evidence for DL we tested long-term 
memory after six weeks and examined its correlation with intrinsic motivation factors. In general, 
our intervention affected the girls' but not the boys' intrinsic motivation. We recorded significantly 
higher post-test scores in the textbook-based lesson, but the differences vanished in the retention test. 
The teacher-assisted consolidation phase increased long-term knowledge and positively intervened 
with the students' interest. Thus, we found evidence for DL. 

Keywords: Deeper Learning; e-Learning effectiveness; gender issues; intermedia comparison; 
learning motivation; hypertext; evaluation of e-Learning; cognitive load 


1. Introduction 

Compared to textbooks, computers are regarded as innovative new media in classrooms leading 
to a substantial increase in motivation scores and cognitive learning outcomes. Students seem to regard 
computers as a new tool which motivates them and promotes cognitive learning [1], although several 
problems could interfere with success in learning [2] by causing cognitive load problems [3]. 

Almost 30 years ago, Salomon assumed that students might see computers as toys and thus 
underestimate the importance of the subject matter [4]. On the other hand, students may regard print 
media as difficult-to-handle learning tools needing much concentration, while in contrast audios and 
films may offer relaxation rather than encouraging hard work; this may reduce the attention level for 
the subject matter and consequently reduce cognitive learning achievements. Meanwhile, schools are 
reaching the digital age. For the last ten years e-learning has been discussed not only as an enrichment 
of teaching materials but also as leading to improvement in personal skills, and to Deeper Learning 
(DL), a concept describing a more meaningful and self-dependent way of learning [5-7]. Therefore, 
teachers change their role-perception from a broadcaster of knowledge to a tutor and change their 
view of their teaching: students have to see themselves as the responsible persons for their learning [6]. 
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The present study analyses learning success and intrinsic motivation in the context of 
a self-regulated learning unit involving either textbooks or computers with varying amounts of 
teacher supervision. 

1.1. What is Hypermedia and Hypertext? 

The terms "hypermedia" and "hypertext" were originally defined in 1965 by Ted Nelson; 
"Hypertext is a method to create and access nonlinear text" [8] (p. 4). Hypermedia describes a version of 
hypertext by emphasizing multimedial aspects. Thus, hypermedia can be understood as a combination 
of the words "hypertext" and "multimedia." 

The primary characteristic of hypertext is the information unit labelled as a "node". A node 
is a basic unit used in computer science. From a semiotic point of view the type of the node is 
a concrete symbol. Thus a node is a central element in dividing information into chunks of knowledge. 
This may be pure text (single character, row or a passage) in the basic hypertext or a graphic, audio 
or video-element in hypermedia [9,10]. The greatest and most common usage of hypermedia is the 
World Wide Web (www): links connect information units (nodes), and organise the information via 
the relationships among the nodes [10]. 

Often the terms hypertext and hypermedia are used synonymously (e.g., [11]). Since we compare 
the effects of a text, presented in a book or in computer-aided multimedia, we decided to use the 
original term hypertext, although the multimedia aspects of the computer-aided learning environment 
could justify the use of the broader term hypermedia. 

1.2. Advantages of Learning with Hypertext 

1.2.1. Learning Motivation and Deeper Learning 

Computer-aided learning is deemed to support interest and learning success because of a learner's 
(inter-)activity (e.g., [12]). Dale (2008) reported "the relative newness and coolness" of computers [7]. 
Since the iPod assignments in his study did not feel like work to the students, it was easier to motivate 
students and draw them into the instructional process. 

There are complex computer programs that enable interactivity, such as communication with 
other learners or adaptive learning programs: common examples are "Wiki"s, collaborative websites 
like the encyclopaedia "Wikipedia" [13] or "LEO" (Link Everything Online) [14], an internet-based 
electronic dictionary with integrated training functions. In contrast to this, in conventional learning 
software, learners still have passive roles by mostly interacting with navigation features [15,16]. 
Just by itself, the dynamic of hypertexts promotes interactivity since the opportunity of self-regulated 
learning encourages exploration of the individual's interests. Thus hypertext is a constructivist learning 
environment, offering a suitable learning setting which specifically supports constructing knowledge 
and preventing boredom [17]. Information offered in nodes is not essential for the present text, 
although the information reached via nodes offers the possibility to deepen one's knowledge of a topic. 
Simply having freedom of choice may help to satisfy curiosity and thus foster joy of studying [18] 
(p. 195). Any hypertext offering the option to deal in more detail with the subject requires additional 
involvement with it, so that the outcome is less likely to be just linear learning but a networked 
thinking process. Students have to process the learning matter, resulting in meaningful knowledge 
unachievable by rote-learning alone [12]. 

Even if activity is reduced, a wide range of information of variable complexity relevant to one's 
interest is offered, thus providing individualised learning [12,15]. The need for navigation and the 
didactic interaction with feedback may support individual motivation and activate thinking abilities, 
intensify elaboration, permit creativity and cultivate learning by discovery [12]. 

More incidental advantages of online learning could be the increase of personal responsibility 
for learning and personal development, especially within the social communities of the platforms [6] 
(p. 311). These are valuable skills, summarised in the concept of Deeper Learning (DL): Learners are 
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not passive recipients. Piaget [19] puts emphasis on the importance of action and problem-solving 
in learning [20]. Learning does not occur by listening to the teacher in silence but is constructed 
through active participation. Similarly, Papert describes such cognitive autonomy as "intellectual 
self-determinism," as children "act as creators rather than consumers of knowledge" [21] (p. 12). 
Knowledge that is created in this way is crosslinked and meaningful, which is why learners forget less. 

Several factors enhancing DL have been identified: degree of interest in, the relevance of and 
the challenge provided by the subject content; a workload which is perceived as comfortable by 
students; provision of a framework which demonstrates interrelationships e.g. through the use of 
concept maps—or hypertexts; assessment instruments which reward Deeper Learning (e.g., a quiz), 
and student involvement in their own learning through the use of strategies such as group work or 
negotiation of topics for subject assessment tasks [22]. 

A learning environment fostering DL gives students the ability to practise their Learning Lab 
skills in the context of meaningful projects that foster critical thinking, problem solving, collaboration 
and learning per se [23]. According to Dale, the hypertext environment is particularly suitable for DL, 
meeting all these demands with the opportunity of self-regulated explorative learning [7]. 

Thus with self-regulated computer-aided learning we expect higher gain in intrinsic motivation 
factors such as interest, in turn associated with higher knowledge gain. 

1.2.2. Learning Success 

The developers Nelson and Engelbart, the fathers of hypertext, were inspired to evolve 
the hypertext technology by Bush's (1945) essay "As We May Think" [24], which describes 
a sort of collective memory machine, called "Memex" (Memory Extender). This precursor of 
hypertext-organised information in a manner comparable to the organisation structure of human 
memory as a semantic network of inter-connected concepts [24]. It was thus capable of changing a flood 
of information into knowledge. Similarly, Collins and Quillian [25] described a "semantic memory" as 
"the net-structure of hypertext reflecting the mental presentation of knowledge," as Ausubel did with 
his "assimilation theory of cognitive learning" [26]. This analogy of hypertext and human memory may 
support the learning process, as students need networked thinking for meaningful understanding [27], 
which is why hypertexts are regarded as an effective educational tool with the additional benefit of 
allowing conceptual learning. Similar to concept maps, hypertexts may provide a perfect tool for 
visual presentation of interrelations [28]. 

Computers also provide an opportunity to access multimedial and multicodal information for 
learners using various learning styles [29,30]. Hypermedial learning environments are therefore 
considered helpful tools to teach even complex, interdisciplinary subject matters with complex 
structured concepts, as in science education. The multimodality and multicodality may help the 
student to understand a structure, and may support cognitive processing [31]. Learning via nodes 
in a hypertext reveals relationships between different concepts: this helps learners in the active 
meta-cognitive progress of long-term knowledge, an essential part of meaningful learning—a step 
to DL. 

DL is "not memorising only to forget and it is not reciting or regurgitating what really is 
not understood and cannot be applied" [32]. DL is thought to lead to long-term learning because 
you forget less; meaningful knowledge is learned when intrinsically motivated, probably with fun. 
Computer-aided learning environments are considered to support DL, as students acquire knowledge 
by exploring according to their interest [6]. In accordance with PISA progressive teaching methods, 
like problem-based, inquiry-based learning, an individual and group project work can be used to 
foster self-regulated learning and deep understanding and prepare students to apply their knowledge 
in novel situations [33]. 
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1.3. Disadvantages 

As early as 1987 Conklin found two main disadvantages of hypertext: "disorientation" ("lost in 
hyperspace") and "cognitive overhead" [34] (p. 40), which may coincide with the theory of "cognitive 
(over-)load" [35]. Nowadays with web 2.0 the problem has become more complex than ever! 

Disorientation may occur with malfunctioning navigation design, supporting associative but 
not target-oriented exploration. This can lead to problems especially when hypertext data are highly 
complex, shallowly structured and/or a user has little pre-knowledge of handling computers and 
hypertext [36,37]: "Navigation in its narrow sense means to move through space; in its broader sense, 
navigation also includes virtual movement through cognitive space made up of data and of knowledge 
emerging from those data" [38] (p. 63). With inappropriate navigation design, a learner has no 
mental map of the hypertext, and may rapidly lose track where he/she is manoeuvring within the 
hypertext and where nodes interlink. Some learners, named "apathetic hypertext users" [39] will 
always get lost in hyperspace. "Knowledge-seekers" or "book lovers" navigate in a consequential, 
logical, systemic and strategic manner [39]. As the name suggests, these students could learn equally 
well with textbooks as with hypertext. "Feature explorers" or "resource junkies" are more interested in 
the style of the hypertext and what kind of screens it contains, than in the important substance of the 
written text. Those learners will use the hypertext exploratively. Workbook usage might remind such 
learners to take in information. 

Since the iPod assignments were not seen as work by the students, it was easier to motivate 
students and draw them into the instructional process [7]. On the other hand, this very point could be 
a disadvantage: Salomon describes an underestimation of the matter as students think "print is hard, 
television is easy" [4]. 

Another crucial problem hindering learning success may lie in cognitive overhead/overload. 
The "Generative Theory of Multimedia Learning" [40] maintains that learners are limited-capacity 
dual encoders who actively process information in order to integrate it meaningfully into their existing 
knowledge. Working memory is needed to integrate new information and handle the learning aid. 
Problematical at this juncture is the limited information capacity of the mental activity of working 
memory, labelled cognitive load (CL) (e.g., [35,41]). Sweller et al. [3] assume three CL components: 
(CL1) intrinsic load caused by content complexity, (CL2) extraneous load caused by the instructional 
mode, and (CL3) germane load necessary for individually processing information in preparation 
for long-term memory. As all three components are assumed to be additive [42], an increase of 
component (CL1) and/or (CL2) without a decrease of the other would lead to the loss of (CL3), causing 
Cognitive Overload. Reducing the capacity for (CL3) germane load would consequently reduce 
cognitive learning of the subject matter. Applied to the CL theory (CL2) extraneous load refers to the 
instruction design. Thus computers and textbooks could cause different CLs. A hypermedia design 
following the principles of multimedia learning can enhance a learner's achievement by avoiding 
cognitive overload [40,43]. The advantages of self-regulated and inquiry-based learning complement 
each other [43]. Many issues fundamental to the use of computers and hypermedia have been studied; 
preeminent are the principles of helpful hypermedia use, identified by Mayer [40]; for instance, 
avoiding split attention which might increase cognitive load [35]; additionally, the modality of 
information has been shown to have an effect on learning owing to its limitation of the working 
memory capacity [41,44], Probably computer handling is not a relevant cognitive load factor for 
"digital natives" [45]. Dale refers to the youth of today as the "iPod generation," one well prepared for 
e-learning [7]. 

1.4. Gender 

A number of studies of gender effects in hypermedia instruction have yielded inconsistent 
results; some report no differences at all [46], others report large differences with regard to interest 
in computers [47]. This may be an effect of the participants' age: children monitored within the 
project "Hole-in-the-Wall Education Ltd" (New Delhi, India) (HIWEL; www.hole-in-the-wall.com) 
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learned computer handling as a required skill (e.g., the internet language English) through practice. 
Adolescents and adults seem to have a natural "timidity", a scepticism towards new methods [48]. 
Previous studies have shown that educational software motivates girls less than boys [2]. This might 
be explained by the fact that designers of the software tend to be male. 

In the 1990s, studies in Germany reported boys to be more experienced with computers than 
girls [49]. Decker [50] drew the conclusion that boys have higher competence in this context, 
a conclusion that could have been premature. Furthermore, the rapid development of children's 
computer handling needs consideration. 

Our research questions were the following: 

• Do students learn with hypertext? 

• Do students gain more long-term knowledge with hypertext? 

• Does lack of teacher's tutoring effect long-term learning success? 

• Does autonomous learning without teacher assistance affect intrinsic motivation? 

• Does higher intrinsic motivation correlate with higher knowledge test results? 

• Are there gender differences in learning emotions and success between the learning environments? 

2. Materials and Methods 

2.1. The Teaching Unit 

Our study consisted of a quasi-experimental before-after-control-impact (BACI) design [51] (read 
more 2.4) with the instructional mode as the independent variable: 

(1) Computer-aided lesson with computer-aided consolidation phase (CALC) (n =51); 

(2) Computer-aided lesson with teacher-assisted consolidation phase (CALT) (n = 114); 

(3) Textbook-based lesson with teacher-assisted consolidation phase (TAL) (n = 172); 

(4) Control group, not participating in the unit (n = 55). 

We designed a computer-aided lesson about honeybees consistent with the Bavarian syllabus for 
the age group. The module "Honeybee—Social Insects" highlighted the following subjects and paid 
particular attention to evolutionary contexts: 

□ Superorganism "bee colony" (e.g., social organisation, bee language); 

□ Timelines (e.g., life history of a working bee); 

□ Products (e.g., beeswax); 

□ Matter for humans (e.g., economy). 

2.2. The Hypertext Design 

We used a common navigation bar used in web 2.0 today (Figure 1). Nodes already displayed on 
pages are marked by changed font colour. An additional site map comparable to the table of contents 
in the textbook helps create a mental map of the hypertext [37]. A glossary is implemented as a tabular 
overview of the most important terms, each linking to the corresponding screen. The glossary is not so 
much a navigational aid through the hypertext as a pointer to the learning target. A working book 
was accompanied by an optional guided tour through the hypertext, as recommended by Jonassen [9]. 
The working book supports the self-regulated learners to reach the educational objective. 
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Phan omen Insektenstaat 



Soziale Insekten 

Superoroanismus 

Mitlieder 

Arbeitsteiluno 

Woben 

Koevolution 

Bienentanz 


Quiz 


Phanomen Insektenstaat 


Soziale Ordnung im Bienenvolk 

Bis zu 50 000 Honigbienen leben gleichzeitig in einem Biencnstock auf engstem ftaum zusammen. 
Sie sammcln Nektar und Pollen, ziehen gemeinsam Larven groB, verteidigen den Stock und teilen 
sich viele andere . Nur bei ganz wenigen Tierarten wie Ameisen, Termiten, Wespen und 

auch Menschen leben vergleichbar viele Individuen in so enger Semeinsschaft zusammen. 

Zwar ist eine kleine Zahl - etwa 2% - der Insektenarten Staaten bildendodzr sozial, diese 
Minderheit macht aber iiber 50% der Masse aller Insekten auf der Erde ausl 

Wie solche Staaten in der der Evolution entstanden sind, 

wie sie organisiert sind, 

wie die Verstandigung erfolgt, 

das sind Fragen, auf die Wissenschaftler mit Hightechmethoden immer neue, immer 
verbliiffendere Antworten und immer wieder auch neue Fragen erhalten. 

Sehen wir uns an, was wir Menschen inzwischen iiber die Bienen wissen... 



Figure 1. Screenshot of the hypertext. 


2.3. Sample and Design 

We selected eighth graders (N = 393) at the top level ("Gymnasium") in Bavaria, Germany. A total 
of 20 biology classes participated in the study. This cohort already had sufficient experience with 
computers since information technology is part of the regular syllabus. The participants' mean age 
was 13.6 (SD = 0.63) years. The gender distribution revealed a higher number of boys (54% to 46%). 

A single teacher unknown to all students tutored all lessons. After a ten-minute introductory 
phase students worked cooperatively in groups of two, or three at most, for 60 min. The students 
could choose their team. Students with CALC and CALT used an informative internet homepage with 
pictures, texts, videos and animations. Students with TAL used an equivalent textbook, with the same 
texts and pictures as the homepage but with the obvious lack of multimodality. 

As a consolidation phase, the class reviewed the subject matter for ten minutes. In CALT and TAL, 
students recapitulated on the basis of a quiz presented with a PowerPoint presentation with teacher's 
tutoring. To exclude teacher's assistance in CALC, students used an equivalent computer-aided quiz 
(cf. Table 1). 


Table 1. Quasi-Experimental (BACI) Design a of the Study. 


Group 

Time Span 

Instruction 1—CALC 

Instruction 2—CALT 

Instruction 3—TAL 

Control 

Pre-test 


K1 

K1 

K1 

K1 

Delay 

Instruction b 

2 weeks 

70 min 

CAL 

CAL 

TAL 


Consolidation 

10 min 

Computer-Aided 

Teacher-Assisted 

Teacher-Assisted 


Post-test 

10 min 

K2 + IMI 

K2 + IMI 

K2 + IMI 

K2 

Delay 

Retention test 

6 weeks 

K3 

K3 

K3 

K3 


Note: a This design implied a one-way multivariate analysis of variance with repeated measures which had to 
be rejected on the grounds of the non-normal distribution of variables; b including a 10-min introduction. 


A workbook provided a guideline through the lessons. Questions in the workbook had to be 
answered autonomously by the team using research in the homepage and the textbook respectively. 
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Students were requested to complete their own workbook. There was no teacher support needed, 
except with regard to technical problems with the in-school computer network. 

2.4. Quasi-Experimental Before-After-Control-Impact Design (BACI) Design 

A study with students at schools in everyday conditions is comparable to an ecological "open field 
experiment": however strict the design, in educational settings there are always many uncontrollable 
factors. This is why we call this study "quasi-experimental". We use a before-after-control-impact 
design (BACI) (Smith [51]): We measure cognitive knowledge before and after the treatment. 

In our study we focused on the effect of media: a text was provided in a hypermedia environment 
using computers or with an equivalent textbook. In a third treatment group we excluded the teacher's 
tutoring completely. The consolidation phase was a quiz with computer feedback instead of the 
teacher-assisted recapitulation. We collected data on students' cognitive learning achievement and 
intrinsic motivation. We examined the effects of computer-aided and textbook-guided learning on 
cognitive knowledge and learning emotions, and took gender into account. 

2.5. Dependent Variable 
2.5.1. Knowledge Test 

Previous knowledge and changes in the level of knowledge were measured by means of a pre-test 
(Kl) two weeks before, a post-test (K2) immediately after the lesson and a retention test (K3) 6 weeks 
after that (Table 1). The questionnaire covered the content of the learning targets of the lesson. 
The 16 items were in multiple-choice format with three distractors and one correct answer (examples 
Table 2). Although multiple choice knowledge tests do not test meaningful learning this tool can test 
long-term learning, which is considered as an indicator for Deeper Learning [32]. 

Table 2. Examples for multiple-choice questionnaires. 


Subject Matter 

Question 

Multiple Choice Distractors 




□ 

By performing the "bee dance" 

(a) 

superorganism 
bee colony 

How does a working bee 

□ 

The bee smells like the food 

tell its colleague what food 
it has found? 

□ 

□ 

This is not told, only the location of the food 
With specific buzzing sounds 




□ 

Cleaning 

(b) 

timelines 

What is the first job of 

□ 

Doing sentry duty 

a new-born working bee? 

□ 

Collecting food 



□ 

Building comb 




□ 

Nectar 

(c) 

products 

Which material is not 

□ 

Honey dew 

collected by bees? 

□ 

Pollen 



□ 

Wax 


As students had hardly any knowledge (sum score M = 5.4 of 16, Figure 2), Kl had a low 
Cronbach's alpha = 0.05. Reliability was, however, acceptably high in K2 (Cronbach's alpha = 0.73) 
and quite good for the learning tests in K3 (Cronbach's alpha = 0.59). Generally, knowledge tests are 
difficult to test for reliability although reliability coefficients less than 0.6 are used for differentiating 
groups [52] (p. 213). 
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Gender 

■ boys 
□girls 


Figure 2. Knowledge sum scores of boys and girls for all instructions without control (Kl: pre-test; 
K2: post-test; K3: retention test; gender differences: ***: p < 0.001; **: p < 0.01). 


Item difficulty, defined as the percentage of correct answers [53] was normally distributed for 
the Kl, K2 and K3. Content validity of the knowledge test is given due to the syllabus-based subject. 
Furthermore, all knowledge items were constructed according to the learning goals of the intervention. 

Since the knowledge sum scores were not normally distributed (Kolmogorov-Smirnov with 
Lilliefors significance correction, p < 0.001), we used non-parametric tests and box plots. We presume 
that the different sample sizes and the quasi-experimental design of the study affected the non-normal 
distributed data (Shapiro-Wilk p = 0.000). 

2.5.2. Intrinsic Motivation Inventory (IMI) 

The "Intrinsic Motivation Inventory" (IMI) [54] is a multidimensional measurement device to 
assess participants' subjective experience related to a target activity (e.g., a lesson). IMI has been applied 
effectively in many experiments related to intrinsic motivation and self-regulation (e.g., [55-58]). 
We applied four of the six subscales of IMI: The subscale interest/enjoyment (IMI-I) is deemed to be 
the self-report measure of intrinsic motivation (seven items; e.g., "I enjoyed doing this activity very 
much."). The concept of perceived competence (IMI-C) is considered to be a positive predictor of both 
self-report and behavioural measures of intrinsic motivation (6 items; e.g., "I think I am pretty good at 
this activity."). The subscale effort (IMI-E) is relevant to some motivation questions and is thought to 
measure activity's relevance (five items; e.g., "I put a lot of effort into this."). The felt tension/pressure 
(IMI-T) is theorized to be a negative predictor of intrinsic motivation (five items; e.g., "I was very 
relaxed in doing this."). We used the German version of IMI, which has previously been successfully 
applied, for instance, by Girwidz et al. [1]. Students scored their answer on a five-digit Likert scale 
ranking from 1 (not true at all) to 5 (very true). The test was integrated in the post-test K2 after the 
knowledge questionnaire, and applied immediately after the intervention (Table 1). 

2.6. Analysis 

For statistical analyses we used SPSS 21.0 (IBM SPSS Statistics, IBM Deutschland GmbH, Ehningen, 
Germany). Due to the partially non-normally distributed variables (Kolmogorov-Smirnov test: p > 0.05) 
we used non-parametric analysis. Knowledge items were scored as correct (1 point) or incorrect 
(0 point) and summed to yield a score. In the case of IMI, we calculated mean scores for each subscale. 
We calculated inter-group differences with the Friedman test (FT) and between-group differences 
with Mann-Whitney U test (MWU) for two independent samples, and the Kruskal-Wallis test (KW) to 
analyse k independent samples. The significance of learning success was calculated using the Wilcoxon 
test (WT). 
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3. Results 

3.1. Cognitive Achievement 

There was no test effect as the control group did not gain knowledge (FT: Chi-Square = 2.995, df = 2, 
p = 0.224). The K1 showed neither differences between treatment groups (KW Chi-Square = 2.769, 
df = 3, p - 0.429) nor between the genders (MWU: Z = —0.864, p = 0.388). 

Students of TAL gained significantly higher sum scores than students of CALT immediately 
after the lesson (MWU: K2 Z = —4.430, p < 0.001; Figure 3— K2) and yielded the highest knowledge 
scores among all students. After six weeks both groups achieved equivalent cognitive sum scores 
(MWU: K3 Z = —1.098, p = 0.272; Figure 3 —K3). In CALT as well as in TAL girls gained significantly 
higher sum scores than boys in K2 (MWU: CALT Z = —2.385, p = 0.017; TAL Z = —3.784, p < 0.001; 
Figure 3 —K2). This was still found after six weeks in K3 (MWU: CALT Z = —3.777, p < 0.001; 
TAL Z = —2.339, p = 0.019; Figure 3— K3). Girls forgot less knowledge after six weeks with CALT, 
which caused significant differences in oblivion rates (K3-K2) (girls MWU: Z = —3.393, p = 0.001; 
all MWU: Z = -3.036, p = 0.002; Figure 4). 



Figure 3. Knowledge sum scores after gender. CALC: computer-aided learning with computer-aided 
consolidation; CALT: computer-aided learning with teacher-assisted consolidation; TAL: textbook 
learning with teacher-assisted consolidation; Kl: pre-test; K2: post-test; K3: retention test; black asterisk: 
treatment effects; blue asterisk: gender effects; ***: p < 0.001; **: p < 0.006; *: p < 0.015). 


All students learned with computers in both CALC and CALT. Cognitive achievement 
immediately after the lesson was equivalent independent of the consolidation phase of CALC or 
CALT (MWU: K2 Z = —1.12, p = 0.265). In respect to long-term learning, students of CALT gained 
significantly higher knowledge scores than students of CALC (MWU: Z = —2.23, p = 0.026). 

The lack of teacher guidance is disadvantageous notably for girls: in CALC—without 
teacher-guided consolidation phase—all students, even the girls, yielded lower knowledge scores in 
K3 than in CALT or TAL. The treatment effect is significant for girls with TAL (K2 MWU: Z = —2.858, 
p = 0.004). But there is no treatment effect with the boys. Accordingly, gender effects were reduced 
in CALC: there is no gender effect in K3 (MWU: Z = —0.949, p = 0.343) and hardly any in K2 (MWU: 
K2 Z = -2.065, p = 0.039). 

This finding is remarkable as girls seemed to learn more in general as both genders have similar 
pre-knowledge scores (MWU Kl: Z = —0.66, p > 0.5), yet girls gain higher sum scores than boys in K2 
(MWU K2: Z = -2.685, p = 0.007) and K3 (MWU K3: Z = -3.974, p = 0.000; Figure 2). 

For the indication of DL we are interested in the oblivion rate, which is calculated from the 
difference of K3 and K2. There is a significant treatment effect with lowest oblivion rates found with 
CALT (Table 3; Figure 4). Girls forgot most with TAL, whereas boys forgot more with CALC. 
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Figure 4. Oblivion (K3-K2) after gender. CALC: computer-aided learning with computer-aided 
consolidation; CALT: computer-aided learning with teacher-assisted consolidation; TAL: textbook 
learning with teacher-assisted consolidation). 

Table 3. Statistic Data of the Knowledge Tests. 


Effect 

Treatment 

Test 

Statistics 

Test Effect 

Control 

Kl, K2, K3 

FT: Chi-Square = 2.995, df = 2, p = 0.224 



K1 

KW Chi-Square = 2.769, df = 3, p = 0.429 

Treatment Group 

TAL—CALT—CALC 

K2 

KW Chi-Square = 21.141, df = 2, p < 0.000 

Differences 

K3 

KW Chi-Square = 11.298, df = 2, p = 0.004 



Oblivion 

KW Chi-Square = 17.456, df = 2, p < 0.000 



K2 

MWU: Z = -4.430, p < 0.000 


TAL—CALT 

K3 

MWU: Z = -1.098, p = 0.272 



Oblivion 

MWU: Z = -3.036, p = 0.002 



K2 

MWU: K2 Z = -1.12, p = 0.265 


CALT—CALC 

K3 

MWU: Z = -2.23, p = 0.026 



Oblivion 

MWU: Z = -3.963, p < 0.000 



K2 

MWU: Z = -21.141, p < 0.000 


TAL—CALC 

K3 

MWU: Z = -11,298, p = 0.004 



Oblivion 

MWU: Z = -17.456, p < 0.000 

Girls' Treatment 
Effects 


K2 

KW Chi-Square = 24.070, df = 2, p < 0.000 

TAL—CALT—CALC 

K3 

KW Chi-Square = 9.802, df = 2, p = 0.007 


Oblivion 

KW Chi-Square = 12.213, df = 2, p = 0.002 


TAL—CALT 

K2 

MWU: Z = -3.915, p < 0.000 


Oblivion 

MWU: Z = -3.393, p = 0.001 


CALT—CALC 

K3 

MWU: Z = -2.858, p = 0.004 


TAL—CALC 

K2 

MWU: Z = -4.024, p < 0.000 


K3 

MWU: Z = -3.106, p = 0.002 

Boys' Treatment 
Effects 

TAL—CALT—CALC 

K2 

KW Chi-Square = 12.890, df = 2, p = 0.002 


TAL—CALT 

K2 

MWU: Z = -3.418, p = 0.001 


K3 

MWU: Z = -2.457, p = 0.014 


CALT—CALC 

K2 

MWU: Z = -2.774, p = 0.006 


Oblivion 

MWU: Z = -2.488, p = 0.013 


TAL—CALC 


No sig. differences! 
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Table 3. Cont. 


Effect 

Treatment 

Test 

Statistics 




Kl 

MWU: Z = 

-0.864, p = 0.388 

Gender Effects 

all treatments 

K2 

MWU: Z = 

-2.685, p = 0.007 



K3 

MWU: Z = 

-3.974, p < 0.000 


TAL 

K2 

MWU: Z = 

-3.784, p < 0.001 


K3 

MWU: Z = 

-2.339, p = 0.019 


CALT 

K2 

MWU: Z = 

-2.385, p = 0.017 


K3 

MWU: Z = 

-3.777, p < .001; 


CALC 

K2 

MWU: Z = 

-2.065, p = 0.039 


K3 

MWU: Z = 

-0.949, p = 0.343 


Kl: pre-knowledge test; K2: post-knowledge test; K3: retention knowledge test KW: Kruskal Wallis test; 
MWU: Mann-Whitney U test; FT: Friedman test; Oblivion: lost knowledge = K3—K2. 


3.2. Intrinsic Motivation Inventory (IMI) 

We found the usual acceptable Cronbach's alpha for interest/enjoyment IMII (seven items; 
Cronbach's alpha = 0.744), perceived competence IMI-C (six items; Cronbach's alpha = 0.673) and 
tension/pressure IMI-T (five items; Cronbach's alpha = 0.622). We rejected the subscale Effort IMI-E due 
to a questionable Cronbach's alpha (five items; Cronbach's alpha = 0.545). Furthermore, this subscale 
is not affected at all. 

There were no significant treatment effects in the IMI subscales without correlation to gender. 
However, in the IMI-I (interest), students yielded the highest mean scores in CALT (Figure 5: IMI 
interest). This is due to the fact that girls' intrinsic motivation is affected by the treatments but not 
the boys'. The IMI-I mean scores are significantly lower in CALC than in CALT (MWU: Z = —2.744, 
p = 0.006) or in TAL (MWU: Z = —1.794, p = 0.073). The difference between CALT and TAL is not 
significant (MWU: Z = —1.448, p = 0.148). This difference between CALC and CALT in IMI-1 is affected 
by the girls' increase in IMI-I. Although girls are more interested in the subject with CALT than with 
TAL (not significant) they are significantly less interested with CALC (MWU: CALC CALT: Z = —2.130, 
p = 0.033). 

Girls' perceived competence IMI-C is higher with CALC than with CALT (MWU: Z = —2.300, 
p = 0.021) but equivalent in CALT and TAL. 

Lastly, girls reported least tension (IMI-T) in TAL with significant difference to CALT (MWU: 
Z = —3.075, p = 0.002) but IMI-T is similar in CALC and CALT. The boys reported similar intrinsic 
motivation in all treatments. 


_&_IT -j- 

IB ill 

Jj— Ji 1 a_- 


! IF IB 


CALC CALT TAL CALC CALT TAL 



Figure 5. Intrinsic Motivation Inventory grouped by treatment, selected by gender: Interest (* p = 0.033), 
Competence (*p = 0.021), Tension (*p = 0.002) grouped into treatment, selected by gender; CALC: 
computer-aided learning with computer-aided consolidation; CALT: computer-aided learning with 
teacher-assisted consolidation; TAL: textbook learning with teacher-assisted consolidation. 
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4. Discussion 

Our examination of the effects of CAL and equivalent TAL in biology lessons focused on 
the differences in cognitive achievement and the intrinsic motivation with regard to gender. 
An underestimation of the subject matter's difficulty [4] was excluded by the regular CAL in the 
eighth grade and workbook usage. 

In line with Schaal [59] we found the highest mean scores for interest for the subject matter 
in CALT. This is notable as there can hardly be any novelty effect as participants were sufficiently 
accustomed to CAL. However, in contrast to Schaal [59], but in line with Conradty and Bogner [60], 
the higher interest scores did not result in higher knowledge scores in K2. This may result from 
learning-inhibiting factors, namely extraneous load (CL2) [3]. But firstly, all students had sufficient 
experience in computers because of regular IT lessons. Secondly, students reported highest perceived 
competence (IMI-C) in CALC. As the software design followed the principles of multimedia learning, 
we conclude that cognitive overload was reduced to a large extent [61]. 

Contrary to the expectations of Dillon and Gabbard [29] and Rasmussen and Davidson-Shivers [30], 
the multicodality of CAL with its opportunity of appealing to various learning styles did not enhance 
cognitive achievements in K2. However, retention rate was higher in CALT than in TAL, indicating 
that computer-aided learning could support learning. Thus the oblivion rate was lowest with CALT, 
indicating a meaningful learning success. Students learning with CALT learned less but then forgot 
less. This could be caused by the fact that interest in the subject (IMI-I) with CALT is higher, probably 
owing to the use of multimedia such as teacher's presence. The learning success we monitored is 
in line with the findings of Dale [7] who perceives this learning success as DL, according to Askew 
and Carnell [62]. It could be argued that the augmentation of personal responsibility for learning 
and personal development [6] is in the tradition of Piaget [19], who challenged the importance of 
action and problem solving in learning [20] (p. 9). In other words, learning is constructed through 
active participation as children engage in the learning process to act out solutions to problems faced. 
Similarly, Papert [21] describes such cognitive autonomy as "intellectual self-determinism," as children 
"act as creators rather than consumers of knowledge." 

Our results point to the relevance of the teacher according to Conradty [60]: CAL as 
an instructional enrichment increased a student's interest in the subject matter. Without the 
teacher-assisted consolidation phase the perceived competence was higher but both interest and 
cognitive achievement were reduced dramatically. The role of the teacher is vital for long-term learning. 
The mixture of "intellectual self-determinism" [21] and the social contact with a teacher in the role 
of a tutor instead of a "broadcaster of knowledge" fosters long-term learning success [60], as we 
found in CALT. Authentic learning is embroiled with issues of socialization and identity, not only-but 
especially-with teenagers, and it would be a mistake to attempt to reify or separate these. 

In line with Dillon and Jobst [63], we found gender effects with participants older than fifth graders. 
At this age, students are adolescent, with gender-related behaviour determined by their already 
developed gender identity [64]. As girls learned more in this study, this might be due to their preference 
for the subject "biology": for years, about 63%-66% of the first-semester students in biology have been 
women [65,66]. In general, girls' motivation and learning success was affected by our intervention, 
but not the boys'. In particular, in CALC learning about the bee dance failed for both genders; indeed, 
girls learned even less than the boys, reflecting perfectly their loss in interest. Probably the lack of 
contact with the teacher served to decrease interest scores even more than the computer increased 
them. These findings contradict Passing and Levin [2] and Conradty [60] whose educational software 
did not motivate girls, whereas in our study, only girls' learning emotions were affected. This might be 
explained by the software and textbook which in our case were designed by a female designer. But in 
both the present study and previous work of Conradty and Bogner [60], the teachers were women. 

A good reason for a computer implementation in regular lessons is the students' opportunity to 
learn required skills casually as monitored within the project "Hole-in-the-Wall Education Ltd" [48,67]. 
These incidental skills are not only computer handling or a foreign language but also DL skills. 
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According to Turvey [6] the type of long-term learning is one that is based upon a deep understanding, 
actively created by students taking responsibility for their learning consistent with their interests. 

5. Conclusions 

In the present study, our findings provide evidence that students who are used to computers 
can adequately learn with them. Computers may provide instructional enrichment by increasing 
a student's interest in the subject matter and providing additional skills such as the ability for Deeper 
Learning. Therefore, computer learning supports meaningful and long-term learning. Important 
for both interest and learning is the subsequent tutoring by a teacher with a refined role-perception. 
With respect to gender in our study, it was the girls' intrinsic motivation who were most affected. 
Notably, girls' learning success with CALT—computer-aided learning with teacher tutoring—increased 
with their interest in the subject matter, though this was not the case with the boys. Therefore, our results 
could be viewed as reflecting the Deeper Learning and motivating effects of CALT. Further studies 
should focus on the effect of teacher's contact on students' interest and learning effect. Continuative 
indicators for Deeper Learning have to be specified to design appropriate studies. 
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Abbreviations 

The following abbreviations are used in this manuscript: 


CL 

Cognitive load 

IMI 

Intrinsic Motivation Inventory 

CAL 

Computer-aided learning 

CALC 

Computer-aided learning with computer-centred consolidation phase 

CALT 

Computer-aided learning with teacher-assisted consolidation phase 

TAL 

Textbook-based learning with teacher-assisted consolidation phase 

K1 

pre-test two weeks before intervention 

K2 

post-test immediately after intervention 

K3 

retention test six weeks after intervention 

DL 

Deeper Learning 

MWU 

Mann-Whitney U test 

FT 

Friedman test 
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